On Dart-Zobel Algorithm for Testing Regular 

Type Inclusion 



Lunjin Lu and John G. Cleary 

Department of Computer Science 
University of Waikato 
Hamilton, New Zealand 
Phone: +64-838-4627/4378 
{ lunj in , j cleary } @ cs . waikato . ac . nz 



Abstract. This paper answers open questions about the correctness 
and the completeness of Dart-Zobel algorithm for testing the inclusion 
relation between two regular types. We show that the algorithm is in- 
correct for regular types. We also prove that the algorithm is complete 
for regular types as well as correct for tuple distributive regular types. 
Also presented is a simplified version of Dart-Zobel algorithm for tuple 
distributive regular types. 
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1 Introduction 

Types are ubiquitous in programming languages j3J. They make pro- 
grams easier to understand and help detect errors since a large num- 
ber of errors are type errors. Types have been introduced into logic 
programming in the forms of type checking and inference [[| [7], [TT[ 
23), |8j or type analysis |22|, |9[ |17|, 0, g(| §, glj or typed lan- 



guages JT4], [T^, [25|, [27fl • Recent logic programming systems allow the 
programmer declare types for predicates and type errors are then 
detected either at compile time or at run time. Even in early logic 
programming systems, built-in predicates are usually typed and type 
checking for these predicates are performed at run time. The reader 
is referred to |24j] for more details on type in logic programming. 



A type is a possibly infinite set of ground terms with a finite 
representation. An integral part of any type system is its type lan- 
guage that specifies which sets of ground terms are types. To be 
useful, types should be closed under intersection, union and com- 
plement operations. The decision problems such as the emptiness 
of a type, inclusion of a type in another, equivalence of two types 
should be decidable. Regular term languages |T3], [|, called regular 



types, satisfy these constraints and has been used widely used as 
types |§, ^ g 0, 0, 0, [TEJ EH 0, 0, |2g, |, 



Most type systems use tuple distributive regular types which are 
strictly less powerful than regular types |2B|, fT5| [TjJ ^5], 

PI , |28[ , p| , [12| , p0| , ||, [Hj]. Tuple distributive regular types are regular 
types closed under tuple distributive closure. Intuitively, the tuple 
distributive closure of a set of terms is the set of all terms constructed 
recursively by permuting each argument position among all terms 
that have the same function symbol ||28|| . Tuple distributive regular 
types are discussed in section [| 

To our knowledge, Dart and Zobel's work || is the only one to 
present, among others, an inclusion algorithm for regular types with 
respect to a given set of type definitions without the tuple distribu- 
tive restriction. Set-based analysis can also be used to deriving types 
based on set constraint solving 0, [l], HH HI' E03- However, set con- 



straint solving methods are intended to infer descriptive types |po 
rather than for testing inclusion of a prescriptive type [25] in another. 
Therefore, they are useful in different settings from Dart-Zobel algo- 
rithm. Dart-Zobel algorithm has been used in type or type related 
analyses |7|, ||]. However, the completeness and the correctness of the 
algorithm are left open. This paper provides answers to these open 
questions. We show that the algorithm is incorrect for regular types. 
We also prove that the algorithm is complete for regular types in 
general as well as correct for tuple distributive regular types. These 
results lead to a simplified version of Dart-Zobel algorithm that is 
complete and correct for tuple distributive regular types. 

The remainder of this paper is organised as follows. Section de- 
fines regular types by regular term grammars. Section [3] recalls Dart- 
Zobel algorithm for testing if a regular type is a subset of another 
regular type. Section [| addresses the completeness and the correct- 
ness of their algorithm that have been left open. In section [|, we 
show that their algorithm is both complete and correct for tuple 
distributive regular types and provides a simplified version of their 
algorithm for tuple distributive regular types. 



2 Regular types 



Several equivalent formalisms such as tree automata [|1^, [J, regular 
term grammars |1| |6|, regular unary logic programs [2£| have been 
used to describe regular types. In ||, Dart and Zobel use regular 
term grammars to describe regular types that are sets of ground 
terms over a ranked alphabet E. 

A regular term grammar is a tuple G = (77, E, A) where@ 

— E is a fixed ranked alphabet. Each symbol in E is called a func- 
tion symbol and has a fixed arity. It is assumed that E contains 
at least one constant that is a function symbol of arity 0. 

— 77 is a set of symbols called nonterminals. These terminals will 
be called type symbols as they represent types. Type symbols are 
of arity 0. It is assumed that II n E = 0. 

— A is a set of production rules of the form a — * r with a e II and 
t eT(EUlI) where T(E U II) is the set of all terms over E U II. 
Terms in T(E U 77) will be called pure type terms. 



Example 1. Let Z 1 = {0, s(), nil, cons (,)} and 77 = {Nat, NatList}. 
G = (77, 17, A) defines natural numbers and lists of natural numbers 
where 

f Nat -> 0, 
. _ I Nat -> s(JVat), 
"~" I Natlist nil, 

{ Natlist — > cons(Nat, Natlist) t 

□ 

The above presentation is slightly different from |§ where pro- 
duction rules with the same type symbol on their lefthand sides 
are grouped together and called a type rule. For instance, produc- 
tion rules in the above examples are grouped into two type rules 
Nat — > {Nat, Natlist} and Natlist — > {mZ, cons(Nat, Natlist)}. 

Types denoted by a pure type term is given by a rewrite rule 
associated with G. t =^-q s if A contains a rule a — > r, a occurs in 
t and s results from replacing an occurrence of a in t by r. Let =^q 

1 A start symbol is not needed in our setting. 



be the reflexive and transitive closure of =>q- The type denoted by 
a pure type term r is defined as follows. 

MG = it G T(U) | r ^ t} 

[t]q is the set of terms over E that can be derived from r by re- 
peatedly replacing the lefthand side of a rule in A with its righthand 
side. 

Example 2. Let G be the regular term grammar in example |I|. We 
have 

Natlist cons(Nat, Natlist) 

cons(s(Nat), Natlist) 
=^-q cons(s(0), Natlist) 
cons (s(0), nil) 

Thus, [Natlist]^ contains cons(s(0), nil). 

The type represented by a sequence ip of pure type terms and a 
set W of sequences of pure type terms are defined as follows. 

[eh = to 
l(r) + V>1 G = Wg >< I^Ig 
[^G = U Mg 

where e is the empty sequence, + is the infix sequence concatenation 
operator, (r) is the sequence consisting of the pure type term r and 
x is the Cartesian product operator. 

The set 77 of nonterminals in Dart and Zobel's type language also 
contains constant type symbols. Constant type symbols are not de- 
fined by production rules and they denote constant types. In particu- 
lar, II contains \i denoting the set of all terms over E and <f> denoting 
the empty set of terms. We will leave out constant type symbols in 
this paper in order to simplify presentation. Re-introducing constant 
type symbols will not affect the results of the paper. 

Dart-Zobel algorithm works with simplified regular term gram- 
mars. A regular term grammar G = (77, S, A) is simplified if [o]q ^ 
for each a G 77 and r ^ 77 for each (a — > r) G A. Every regular 
grammar can be simplified. 



3 Dart-Zobel Inclusion Algorithm 



This section recalls Dart and Zobel's inclusion algorithm for reg- 
ular types. As indicated in section^], we shall disregard constant 
type symbols and simplify their algorithm accordingly. We note that 
without constant type symbols, many functions in their algorithm 
can be greatly simplified. In place of a type rule, we use the cor- 
responding set of production rules. These superficial changes don't 
change the essence of the algorithm but facilitate the presentation. 
We shall assume that G is a simplified regular term grammar and 
omit references to G where there is no confusion. 

We first describe the ancillary functions used in their algorithm. 
Let if) = tit 2 ■ ■ ■ r n be a non-empty sequence of pure type terms and 

def 

\P be a set of non-empty sequences of pure type terms. head(if)) = T\ 

and tail(ifj) d = t% ■ ■ ■ r n . heads and tails are defined as heads d — 

{head(ip) \ ip E and tails (W) = {tail(ip) \ ijj E The function 
expand rewrites a non-empty sequence into a set of sequences when 
necessary. 



expand (if)) 



def j {if)} if head(tp) g II 

~ \{(t) + tail(if)) | (head{i>) -> r) E A} if head(ip) E II 



expands^) d = U^e^ expand(ip) . 

The function s elects (i~,]£ r ) defined below applies when r is pure 
type term and t E" II and is a set of non-empty sequences with 
heads D il = 0. The output of selects (t,\P) is the set of the 
sequences in \P that have the same principal function symbol as r. 

selects(f(r 1 , ■ ■ ■ ,T n ),W) = {if) E & \ head(ip) = f(u h ■ ■ ■ ,u) n )} 

Note that /(ri, • ■ • ,r n ) is a constant when n — 0. 

The function open(if)') defined below applies when if)' is a non- 
empty sequence with head{if)') E~ II. open{if)') replaces the head of 
if)' with its arguments. 

Open(f(T! , ■ ■ ■ , T n ) + if)) = TiT 2 • • • T n + if) 

When n — 0, open(f(ri, ■ ■ • ,r n ) + if)) = if). Without constant type 
symbols, open doesn't need an extra argument as in || that is used 
to test membership of a term in a constant type and to indicate the 



required number of arguments when the constant type symbol is \i. 

def 

opens = {open(ip) \ ip G 

The inclusion algorithm subset(ri,T 2 ) takes two pure type terms 
T\ and r 2 and is intended to decide if [rj^ C [t 2 ]q is true or false. 
The core part subsetv of the inclusion algorithm takes a sequence ip 
of pure type terms and a set \P of sequences of pure type terms that 
are of the same length as ip and is intended to decide if f0]g C [P]q. 
subsetv takes a third argument C to ensure termination. C is a set 
of pairs (/3, T) where (3 G 77 is a type symbol and T C T(SUII) is a 
set of pure type terms. A pair (/?, T) in C can be read as [0\q C [7]^. 

The functions subset and subsetv are defined in the following. 
Where several alternative definitions of subsetv apply, the first is 
used. 



subset^, r 2 ) = subsetv^Tx), {(r 2 }},0) 

subsetv (ip,&,C) = 

' false if # = 

true if t/j = e 

subsetv (tailty), tails C) 

if (head(ip),r) e C and heads (V) D T 
Wifj' G expand(ijj) .subsetv (ip' , !?,CU {(head(if), heads(\P))}) 

if head(ip) G 77 
subsetv (open(i/)) , opens (selects (head (i/j) , expands (&))) , C) 

if head(ip) = f(r ir --,T n ) 

The second condition heads(\P) D T for the third alternative is 
obviously mistaken to be heads(\P) C T in p. The first two alter- 
natives deal with two trivial cases. The third alternative uses pairs 
in C to force termination. As we shall see later, this is fine for tu- 
ple distributive regular types but is problematic for regular types in 
general. The fourth alternative expands ip into a set of sequences 
ip' and compares each of them with . The fifth alternative applies 
when ip = /(ri, • - • , r n ) + t//. Sequences in \P are expanded and the 
expanded sequences of the form f(o~i, ■ ■ • , a n ) + lu' are selected, ip 
and the set of the selected sequences are then compared after re- 
placing /(ti, • • • , r n ) with Tx • • • r n in ^ and replacing • • • , cr n ) 
with ox • • • o~ n in each /(<Ji, • • • , cr„) + cj'. 



4 Correctness and Completeness 



We now address the correctness and the completeness of Dart-Zobel 
algorithm that were left open. We first show that the algorithm is 
incorrect for regular types by means of a counterexample. We then 
prove that the algorithm is complete for regular types. Thus, the 
algorithm provides an approximate solution to the inclusion problem 
of regular types in that it returns true if inclusion relation holds 
between its two arguments while the reverse is not necessarily true. 



4.1 Correctness - a counterexample 



The following example shows that Dart-Zobel algorithm is incorrect 
for regular types. 

Example 3. Let G = (II, E, A) with 77 = {a, (3, 9, a, to}, 
E = {a,b,g(),h(,)} and 



a 

P 

e - 

a 



a | h(6, a) 
b 

■ a 



h(a, b) 

| b | h(u, a) | h(u, b) 



where, for instance, 9 
— > a and 9 — > h(9, a) 



[0] G = {* e T{E h ) 
[a\ Q = {te T(E h ) 

Hg = ig(t) I * e Mg) 
\J3]g = {g(t) 1 1 e [9] G u [a] G } 



-> a | h(6, a) is an abbreviation of two rules 
Let E h = E\{h}. We have 

| t is left-skewed and leaves of t are a's} 
| t is left-skewed and leaves of t are b's} 
| t is left- skewed} 



Let t = g(h(h(a,b),a)). t e [a] G and t £ [/%. Therefore, [a] G % 
[0\q- The incorrectness of Dart-Zobel algorithm is illustrated by 
showing subset(a,/3) = true as follows. Let C = {{a, {/?})}■ We 
have 

subset(a, (3) = subsetv({a),{{P)},$) by def. of subset 

= subsetv({g(u)), {(/?)}, Co) by 4th def. of subsetv 
= subsetv ((cu) , {(9), (a)}, C ) by 5th def. of subsetv 



Let C\ = Co U {(lo, {9, a})}. By the fourth definition of subsetv 
and the above equation, 



subset(a, (3) 



( subsetv((a),{(9),(a)},C 1 )\ 
A 3ubsetv((b) t {{6),(<r)},Ci) 
A subsetv ((h(u, a)), {(6), (a)}, d) 

V A subsetv {(h{u, &)), {(6), (a)}, d) J 



By applying the fifth and then the second definitions of subsetv, 
subsetv((a) , {{9), (&)}, C%) = subsetv(e, {e}, C\) = true. In the same 
way, we obtain subsetv({a) , {(9), (a)},Ci) = true. 

subsetv((h(u, a)}, {{9}, (a)}, C x ) 

= subsetv((u, a), {(9, a), (a, b)}, Ci) by 5th def. of subsetv 
= subsetv({a) , {(a), (&)}, C\) by 3rd def. of subsetv 

= subsetv (e, {e}, C\) by 5th def. of subsetv 

= true by 2nd def. of subsetv 

We can show subsetv ({h(u, a)), {{9), {&)}, C\) = true in the same 
way as above. Therefore, by equation [I], subset(a, 0) = true and 
subset is incorrect for regular types. 
□ 

The problem with the algorithm stems from the way the set C is 
used in the third definition of subsetv. As the above example indi- 
cates, the third definition of subsetv severs the dependency between 
the terms in a tuple, i.e., subterms of a term. 

In ||, Dart and Zobel show by an example that their algorithm 
works for some regular types which are not tuple distributive. We 
don't know what is the largest subclass of the class of regular types 
for which the algorithm is correct. 



4.2 Completeness 

We now prove that Dart-Zobel algorithm is complete for regular 
types in the sense that subset(ri,T 2 ) = true whenever [rj^ C {t 2 ]q. 
Let C be a set of pairs (/?, T) with (3 E II and T C T(S U 77). A 
pair (f3, T) in C states that the denotation of (3 is included in that 
of T, i.e., [0\q C [T]q for regular types. Define 

r c ,Q = A (/3 ,r >e c[/?] G C [T] G 



The completeness of subset follows from the following theorem 
which asserts the completeness of subsetv. 

Theorem 1. Let if) be a sequence of pure type terms and \P a set of 
sequences of pure type terms of the same length as if), C a set of pairs 
((3,T) with PeLJ and T C T(E U 77). If T c G (= j^] G C [#] G then 
subsetv C) = true. 

Proof. Assume subsetv(if), \P, C) = false. The proof is done by show- 
ing r C Q \t= [i/j]q C \&]q. This is accomplished by induction on 
(dp(if),W,C),lg(if))) where lg(ifi) is the length of if) and dp(ip,W,C) 
is the depth of the computation tree for subsetv(if),^,C). Define 

(k, I) < (k', V) = J (k < k') V (k = k') A (I < V). 

Basis. dp(ip, C) = and lg(ip) = 0. ip = e an d ^ = since 
subsetv '(ip,&,C) = false. Let t = e. t G [ip] G and t £" [?^] G . So, 

r c ,G ¥= Mg c [*] g . 

Induction. dp(if), &,C) ^ or /g^) 7^ 0. By the definition of 

subsetv, 

(a) ^ = 0; or 

(b) subsetv (tail(if)), tails (\P),C) = false and there is T C T(I7 U 77) 
such that ({head(if)),T) G C) A (heads(V) D T); or 

(c) head(if)) G 17 and B.^' G expand(if)). subsetv (if)' ,\P,C) = false 
where C" = CU {(head(ip), heads(\P))}; or 

(d) head{ip) = /(ri, ■ • • , r n ) and subsetv (if)', \P',C) = false where 
■0' = open(ifj) and = opens (selects (head (if)) , expands (&))). 

It remains to prove that 7^ G ^= [0] G C [<^] G in each of the cases 
(a)-(d). The case (a) is trivial as G is simplified and hence [if)] G 7^ 0. 

In the case (b), we have dp(tail(ip), tails (\P), C) < dp(if), C) and 
lg(tail(if))) < lg(ifj). By the induction hypothesis, r cG \t= [tail(if))]Q C 
[tails(V)] G . Thus, r cG (= 3t'.(t' G [ta^(^)] G At' [tails (&)] G ). Let 
£ G [head(if))]Q and t = (t) + 1'. Note that t exists as G is simplified. 
We have r CjG |= t G [if)] G A t G* [<% So, r c>G £ {0] G C [<% 

In the case (c), dp(ip' , \P, C) < dp(ifj, \P, C). By the induction hy- 
pothesis, r c , G Y= m G C Note that r c , )G = r CiG A([«(^)] G C 
[heads(V)] G ). So, wehave r CjG h M G 2 m G V{head(if))] G % [heads (V)] G . 
Assume r cG = true. Either (i) [if)] G £ W\q or (ii) [head(if))] G % 
[heads{&)] G . In the case (i), 3.t'.(t' G [if/\ G ) A(t' £ [#] G ). By proposi- 
tion 5.26 in [|], we have r cG \£ [if)] G C [#] G . In the case (ii), 3t.(t G 



[head(ip)} G ) A (t & [heads(W)] G ). Let t' E [tail(ip)] G and t = (t) + 1'. 
Note that t' exists as G is simplified. We have t e [V>] G At ^ [#] G . 
So, r CjG ^ M G C [fl>] G in the case (c). 

In the case (d), we have ij/ = T\ • • • T n +tail(i/j) and dp(i/)', C) < 
dp{ip,V,C). By the induction hypothesis, r cG tf= [i//\ G C [W\ G . 
Thus, r cG h 3t 1 .3t 2 .(/( 7 (t 1 ) = n) A ((tj + t 2 ) e M G ) A ((ti + 
ta) [^1 G )> which implies r cG |= 3t 1 .3t 2 .((/(t 1 )) + t 2 ) e M G ) A 

(«/(ti)> + 1 2 ) £ [%). So, r CiG ^ M G c m G . 

□ 

The completeness of subset is a corollary of the above theorem. 

Corollary 2. Let n and r 2 fee pwe £?/;?e terms. If [t\] g C [r 2 ] G £/ien 
subset(ri,T2) = true. 

Proof. subset(ri,T2) = subsetv((ri), {(t 2 )}, 0) by the definition of 
sufeset. We have T 0jG (= [( n )] G C [{(r 2 )}] G since [rJ G C [r 2 ] G . The 
corollary now follows from the above theorem as JJ) G = true. 
□ 

5 Tuple Distributive Regular Types 

Most type languages in logic programming use tuple distributive 
closures of regular term languages as types |2(| |2U|, [T5], |H| 
0, 0, ||, 0, 0, |g, |, 0. The notion of tuple distributivity is 



due to Mishra |22[ . The following definition of tuple distributivity is 
due to Heintze and Jaffar [15|. Each function symbol of arity n is 
associated with n projection operators //^ , fZ> , • • • , fr}. Let S be a 

set of ground terms in T{E). fzt is defined as follows. 

f{i){S) = {U | f(h, ■■■,t i ,---,t n ) e S} 
The tuple distributive closure of S is 

S* = {c\ c <E S A c <E E } U {/(*!, •••,*„)!*<€ 

where Eq is the set of constants in E. 

The following proposition results from the fact that (.)* is a 
closure operator and preserves set inclusion, i.e., Si C S 2 implies 
S* C S 2 *. 



Propositions. Let S h S 2 C T(E). {Si U S 2 f = U S*)*. 
□ 

The tuple distributive regular type (r)^ associated with a pure 
type term r is the tuple distributive closure of the regular type [t] g 
associated with r ||22|| . 

(r) G = f Ml 

Let ip be a sequence of pure type terms, ^ be a set of sequences 
of pure type terms of the same length. 

(e) G = f {e} 

W G = {(*) + 1 I te (head(il>)) G A t G (taUW) G } 
W G = (U (headWh)* * <tote(#))G 

The definition of (&)q makes use of tuple distributivity and hence 
severs the inter-dependency between components of a sequences of 
terms. 

5.1 Correctness 

We now prove that Dart-Zobel algorithm is correct for tuple dis- 
tributive regular types in the sense that if subset(ri, t 2 ) = true then 
( t i)g — ( t 2)g- Let C be a set of pairs (/3,T) with (5 G 77 and 
T C T(Sun). A pair ((3, T) in C represents (ii) Q C (T) G for tuple 
distributive regular types. Define 

*C,G - A( A r> e c(/3>G C (T> G 

The correctness of subset follows from the following theorem which 
asserts the correctness of subsetv for tuple distributive regular types. 

Theorem 4. Let %b be a sequence of pure type terms and & a set of 
sequences of pure type terms of the same length asip, C a set of pairs 
(P, T) with (3 G 77 andT C T(E U 77). 7/ sn6seto(^, C) = true 

Proof. Assume subsetv(ip, \P, C) = true. The proof is done by induc- 
tion on (dp(ip, C),lg(-ip)). 



Basis. dp(if), \P,C) =0 and lg(if>) = 0. if) = e and \P ^ by the 
second definition of subsetv. So, \P = {e} and $ C G l = ("0)g — (^)g- 
Induction. dp(if>, \P,C) ^ or lg(if>) ^ By the definition of 

subsetv, 

(a) sub setv (tail (if)), tails(\P), C) = true and there is T C T(E U 77) 
such that ((head(ip),T) G C) A (heads(V) D T); or 

(b) head(ip) G 71 and V.?// G expand(if)).subsetv(if)',\P,C') = true 
where C" = CU {(head(ip), heads •(#'))}; or 

(c) head(ip) = f(r\, ■ • ■ ,T n ) and sub setv (if)' , SP' , C) = true where -0' = 
open(ip) and If 7 = opens(selects(head(ip) , expands (\P))). 

It remains to prove that G H (V')g — (^)g ^ n eacn °f the cases 
(a)-(c). 

In the case (a), we have dp(tail(if>), tails (\P), C) < dp(ip, \P, C) and 
lg(tail(if))) < lg(tf>). By the induction hypothesis, $ C G \ = {tail(ip)) G Q 
(tails (V)) G . ((head(ifj),T) G C) and (heads(V) D f) imply <P cG \= 
{head(ifj)) G C (Ws(^)) G . Thus, <7 V . G h W G ^ <*'>G b Y fhedef- 
initions of (V')g an d (^)g- Note that tuple distributivity is used in 
the definition of {&) G . 

In the case (b), dp(if>' ,\P,C) < dp(ip,\P,C). By the induction 
hypothesis, <1> C , G \= (ip') G C (&) G . Note that f ,. G = ^c.G A 
((W(^)) G C (heads(#)) G ). So, 

^c,G H c <#> G V ( W(^)) G ^ ( W,(^)) G 

subsetv (head (if>), heads (&),C) = true since subsetv(if),\P,C) = true. 
We have - G |= (head(if>)) G C (heads (&)) G by the induction hy- 
pothesis since dp(head(if>) , heads (W),C) < dp(if),]P,C). So, $ cG |= 
W G C («7/> g . ( ,g h (W I V>' e ex^(^)}) G C (!7/) G ) since (.)* 
is a closure operator and hence $c,G H (VOg — (^)g 

In the case (c), we have if)' = t± ■ ■ ■ r n + tail(if>) and dp(if>', W , C) < 
dp(ip,&,C). By the induction hypothesis, <P cG \= (ip') G C (\P') G . 
By proposition 5.29 in [EJ, l = (V')g — (^) G - This completes the 
proof of the theorem. 
□ 

The correctness of subset is a corollary of the above theorem. 

Corollary 5. Let T\ and T2 be pure type terms. If subset(r\,T2) = 
true then (r { ) G C (~>) G . 



Proof. Let subset{ji, r 2 ) = true. SM6seto((ri), {(r 2 )}, 0) = true by 
the definition of subset. Thus, <P^ q \= ((ti))q C ({(r 2 )})^ according 
to the above theorem. So, (ti)^ C (r 2 )^ as (Pq q = true. 
□ 

5.2 Completeness 

This section presents the completeness of Dart-Zobel algorithm for 
tuple distributive regular types. The following theorem is the coun- 
terpart of theorem [l]. 

Theorem 6. Let if) be a sequence of pure type terms and \P a set of 
sequences of pure type terms of the same length as ip, C a set of pairs 
(P,T) with Pen and TC T(EUII). If$ c ,G h Wg £ Wg then 
subsetv(i/j,\P, C) = true. 

Proof. The proof can be obtained from that for theorem |l| by simply 
replacing r._. with <?>.,. and []q with { )q. 

□ 

The following completeness result of Dart-Zobel algorithm for tu- 
ple distributive regular types follows from the above theorem. 

Corollary 7. Let T\ and r 2 be pure type terms. If (tx)q C (t 2 ))q then 
subset(Ti,T 2 ) = true. 

Proof. The proof can be obtained from that for corollary |2| by sim- 
ply replacing i~( v ) with $(.,.), with ( )q and theorem [1] with the- 
orem |6|. 
□ 

5.3 A Simplified Algorithm 

Now that Dart-Zobel algorithm is complete and correct for tuple 
distributive regular types but not correct for general regular types. 
It is desirable to specialise Dart-Zobel algorithm for tuple distribu- 
tive regular types which was originally proposed for general regular 
types. The following is a simplified version of the algorithm for tuple 
distributive regular types. 



subset' (ti,t 2 ) = subset' (ti, {r 2 },0) 

subset' \t,T,C) d = 

' false if T = 

true 

if «r,r') G C) A(TDT') 

' Vr' G eayand'(r).s'u6set / (r / ,r,CU{(r,r)}) if r G i7 

subsetv'iji ■ ■ ■ T n , {fi • • ■ fn I /(ci, • • • , &n) G expands' (T)}, C) 

if r = /( Tl) ... )Tn ) 

subsetv'(e, {e}, C) = true 

subsetv'(tp,&,C) = 

subset' (head(ip), heads C) A sub setv' (tail (i/j), tails C) 

expander) d M | W r _ >ff)e ^ } ^^ 
expands' (T) = [J expand' (r) 

While Dart-Zobel algorithm mainly deals with sequences of pure 
type terms, the simplified algorithm primarily deals with pure type 
terms by breaking a sequence of pure type terms into its component 
pure type terms. This is allowed because tuple distributive regular 
types abstract away inter-dependency between component terms in 
a sequence of ground terms. We forgo presenting the correctness and 
the completeness of the simplified algorithm because they can be 
proved by emulating proofs for theorems p] and |j. 

6 Conclusion 

We have provided answers to open questions about the correctness 
and the completeness of Dart-Zobel algorithm for testing inclusion 
of one regular type in another. The algorithm is complete but in- 
correct for general regular types. It is both complete and correct for 
tuple distributive regular types. It is our hope that the results pre- 
sented in this paper will help identify the applicability of Dart-Zobel 
algorithm. We have also provided a simplified version of Dart-Zobel 
algorithm for tuple distributive regular types. 
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